A Hybrid Mood Classification Approach for Blog Text

نویسندگان

  • Yuchul Jung
  • Hogun Park
  • Sung-Hyon Myaeng
چکیده

As an effort to detect the mood of a blog, regardless of the length and writing style, we propose a hybrid approach to detecting blog text’s mood, which incorporates commonsense knowledge obtained from the general public (ConceptNet) and the Affective Norms English Words (ANEW) list. Our approach picks up blog text’s unique features and compute simple statistics such as term frequency, n-gram, and point-wise mutual information (PMI) for the SVM classification method. In addition, to catch mood transitions in a given blog text, we developed a paragraph-level segmentation based on a mood flow analysis using a revised version of the GuessMood operation of ConceptNet and an ANEW-based affective sensing module. For evaluation, a mood corpus comprised of real blog texts has been built semi-automatically. Our experiments using the corpus show meaningful results for 4 mood types: happy, sad, angry, and fear.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Experiments with Mood Classification in Blog Posts

We present preliminary work on classifying blog text according to the mood reported by its author during the writing. Our data consists of a large collection of blog posts – online diary entries – which include an indication of the writer’s mood. We obtain modest, but consistent improvements over a baseline; our results show that further increasing the amount of available training data will lea...

متن کامل

An Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification

In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

A Computational Approach to the Analysis and Generation of Emotion in Text

Sentiment analysis is a field of computational linguistics involving identification, extraction, and classification of opinions, sentiments, and emotions expressed in natural language. Sentiment classification algorithms aim to identify whether the author of a text has a positive or a negative opinion about a topic. One of the main indicators which help to detect the opinion are the words used ...

متن کامل

An Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification

In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006